5,572 research outputs found
Towards Maximising Openness in Digital Sensitivity Review using Reviewing Time Predictions
The adoption of born-digital documents, such as email, by governments, such as in the UK and USA, has resulted in a large backlog of born-digital documents that must be sensitivity reviewed before they can be opened to the public, to ensure that no sensitive information is released, e.g. personal or confidential information. However, it is not practical to review all of the backlog with the available reviewing resources and, therefore, there is a need for automatic techniques to increase the number of documents that can be opened within a fixed reviewing time budget. In this paper, we conduct a user study and use the log data to build models to predict reviewing times for an average sensitivity reviewer. Moreover, we show that using our reviewing time predictions to select the order that documents are reviewed can markedly increase the ratio of reviewed documents that are released to the public, e.g. +30% for collections with high levels of sensitivity, compared to reviewing by shortest document first. This, in turn, increases the total number of documents that are opened to the public within a fixed reviewing time budget, e.g. an extra 200 documents in 100 hours reviewing
Active Learning Strategies for Technology Assisted Sensitivity Review
Government documents must be reviewed to identify and protect any sensitive information, such as personal information, before the documents can be released to the public. However, in the era of digital government documents, such as e-mail, traditional sensitivity review procedures are no longer practical, for example due to the volume of documents to be reviewed. Therefore, there is a need for new technology assisted review protocols to integrate automatic sensitivity classification into the sensitivity review process. Moreover, to effectively assist sensitivity review, such assistive technologies must incorporate reviewer feedback to enable sensitivity classifiers to quickly learn and adapt to the sensitivities within a collection, when the types of sensitivity are not known a priori. In this work, we present a thorough evaluation of active learning strategies for sensitivity review. Moreover, we present an active learning strategy that integrates reviewer feedback, from sensitive text annotations, to identify features of sensitivity that enable us to learn an effective sensitivity classifier (0.7 Balanced Accuracy) using significantly less reviewer effort, according to the sign test (p < 0.01 ). Moreover, this approach results in a 51% reduction in the number of documents required to be reviewed to achieve the same level of classification accuracy, compared to when the approach is deployed without annotation features
LabLineup: An Intuitive Web Application for Queueing Help Requests in Academic Labs
LabLineup is a start-up project designed to address the needs of large, lab-based classes, where teaching assistants (TAs) struggle to assist students in the order in which they request help. LabLineup is a lightweight web application that allows TAs and professors to accept requests for help in order. LabLineup allows professors to view the requests for a lab and see frequently asked questions that can be addressed en masse rather than individually. LabLineup also allows students to provide TA feedback. LabLineup addresses issues arising from requesting help in large, lab-based classes in the most efficient manner possible
Towards a classifier for digital sensitivity review
The sensitivity review of government records is essential before they can be released to the official government archives, to prevent sensitive information (such as personal information, or that which is prejudicial to international relations) from being released. As records are typically reviewed and released after a period of decades, sensitivity review practices are still based on paper records. The transition to digital records brings new challenges, e.g. increased volume of digital records, making current practices impractical to use. In this paper, we describe our current work towards developing a sensitivity review classifier that can identify and prioritise potentially sensitive digital records for review. Using a test collection built from government records with real sensitivities identified by government assessors, we show that considering the entities present in each record can markedly improve upon a text classification baseline
A Study of SVM Kernel Functions for Sensitivity Classification Ensembles with POS Sequences
Freedom of Information (FOI) laws legislate that government documents should be opened to the public. However, many government documents contain sensitive information, such as confidential information, that is exempt from release. Therefore, government documents must be sensitivity reviewed prior to release, to identify and close any sensitive information. With the adoption of born-digital documents, such as email, there is a need for automatic sensitivity classification to assist digital sensitivity review. SVM classifiers and Part-of-Speech sequences have separately been shown to be promising for sensitivity classification. However, sequence classification methodologies, and specifically SVM kernel functions, have not been fully investigated for sensitivity classification. Therefore, in this work, we present an evaluation of five SVM kernel functions for sensitivity classification using POS sequences. Moreover, we show that an ensemble classifier that combines POS sequence classification with text classification can significantly improve sensitivity classification effectiveness (+6.09% F2) compared with a text classification baseline, according to McNemar's test of significance
The biological availability of cobalt to ruminants
The aetiology of cobalt deficiency in ruminants is not clear and this study focussed on a crucial step in cobalt metabolism, namely its incororation, by rumen microbes, into the essential vitamin B12. Continuous cultures of rumen microbes (Rusitec) were used to study factors influencing vitamin B12 biosynthesis.An improved technique for estimating cobalamin (true vitamin B12) in biological fluids rich in vitamin B12 analogues was developed.Cobalamin and analogue production was increased, from the low levels encountered with cobalt deficient hays and barley, by infusing inorganic cobalt into the cultures. The utilisation of cobalt from cobalt deficient barley in the synthesis of cobalamin was more efficient than that from cobalt-deficient and cobalt-adequate hays, contrary to the generally held ideas. For any substrate, the synthesis of vitamin B12 analogues was far more efficient than that for cobalamin. Inorganic cobalt was incorporated into cobalamin and vitamin B12 analogues far less efficiently than that from either or hay and barley. Cobalt from four soils substantially increased cobalamin and particularly analogue synthesis. The use of monensin, a fermentation manipulator, had greater effects on digestibility and VFA production than on vitamin B12 synthesis
Management education within a cultural confluence : Twinning programmes in Malaysia
In its steady advance toward Vision 2020, Malaysia has been working hard to balance the skills and competencies of its people commensurate with its economic growth. To this end, educational twinning programmes abound in Malaysia. At present, these programmes are inevitably reliant to a considerable degree on Western thought and science. This paper, written from the perspective of two twinning programme directors, examines cross-cultural management issues underlying educational programme design and delivery. Its focus concentrates on the execution of commerce and business programmes
Ethical Principles for the Conduct of Research in the North
Since the publication of the Ethical Principles in 1982, they have proven their worth by becoming the most widely cited and adopted among northern researchers in Canada. Since then, however, the situation in the North has changed significantly. Many First Nations, the Inuvialuit, and the Inuit have settled land claims and, in many cases, related Self-Government Agreements. Land and other regimes have altered. Researchers now find the research context shifting, often unpredictably. Communities have sometimes found themselves and their concerns disregarded by researchers. A renewed research relationship has been called for and is emerging. A new spirit of partnership between northerners and researchers is emerging in northern research. Of course, the nature of any particular partnership will depend on the specific project. The new partnership ethic, however, emphasizes the need to create meaningful relationships with the people and communities affected by research. Another change is the increasing involvement of northerners not
only as subjects or passive observers of research but in all aspects of the research process. Northerners are actively involved in research from conception to reporting, from funding to licensing. For all
parties to benefit fully from research partnerships, mutual understanding is critical. High quality research depends both on communities understanding the needs and concerns of researchers and on researchers understanding the needs and concerns of communities. Guidelines, or principles, are needed to provide a foundation for
and to foster a mutual understanding of community and researcher needs and goals and to ensure that research is carried out with the least friction and social disruption and the most co-operation and support. The 20 principles presented here are intended to encourage the development of co-operation and mutual respect between researchers and the people of the North. They are also intended to encourage partnership between northern peoples and researchers that, in turn, will promote and enhance northern scholarship
Theta characteristics and the fixed locus of [-1] on some varieties of Kummer type
We study some combinatorial aspects of the fixed loci of symplectic
involutions acting on hyperk\"ahler varieties of Kummer type.
Given an abelian surface with a -polarization , there is an
isomorphism between a hyperk\"ahler
of Kummer type that parametrizes length points on and one that
parametrizes degree line bundles supported on curves in ,
where is a -polarization on . We examine the
bijection this isomorphism gives between isolated points in the fixed loci of
when is odd, which has a combinatorics related to theta
characteristics.
Along the way, we give numerical values for a formula of \cite{KMO} counting
the number of components of a symplectic involution acting on a Kummer-type
variety.Comment: 22 pages, 2 figure
- …